Nucleotide sequence of the & clavuligerus chromosome including and 
flanking casl (SEQ ID NO:l) 



Ncol . 

1 

GGTACCGCCCGCCGCCGACGGGGCCTCGGAGCCGGCCTGGCCACTGGTCCTGGTGGGGCC 60 

MAPPPQGPAEAPGTVLVVG 



61 

ACCCTATC ACCGGGCGGTGGGCCGCGTCGTCTGAGGGCCTGTGCCTGGGC ACCCACACGC 120 
TPYHGAVRRLLSGSVSGHTH 

<orfup3 

121 

GCCTTTCCGGGCCTCCGGCCCAGTGTCGGTGCCCATTGCGCGCCACAGGAACGGGCGCAT 180 

*LWPYRATDKGA 

Y 

ASLGPPRTM 



181 

TAGCCCCAGGTCTATCTGCTTCCGGGCCACCTGCTCCTTCAGGGCGTGGAGCATCTGGCA 240 
DPDLYVFARHVLFDRVEYV 

T 



241 

CGTGGTCGCGGGCCGCCGGGTGAGCCCCAGTGGGCGGGCGGTGCCGGGCAGGGCCACGAG 300 
CWRGAAWEPDGARWPGDRH 

E 



301 

TGGCACCCACCACGGGAGGCGCCGCTCCTCAAGCCAGGGCCAGTCTTAGGTCAACTGCCT 360 
GHTTGEAALLETGTLIWNV 

S 



361 

GGTGTCTACCACCCACTAGCTCGCCTACCACGGGGGCTCCAGCAGCTTCTCGGCCCGCTA 420 

FIG. 1A 



WLHHTISRITGGLDDFLRA 

I 



421 

GAGCCTGAACGGGGCCCGGTCTGGGGTGAACCCCTTCTTCTTCTGGCGC AGGAGCCGCTT 480 
ESKGRALGWKPFFFVADEA 

F 



481 

CATCAGCTAGCGCCCCCACGGCAGCGACGGCTGCGGCGGCAACAGCTTGCGGAACTTCAT 540 
YDIAPTGDSGVGGNDFAKF 



O Y 



<orfup2 
541 

GCGCCACTACTGGCGGAACGCGACGAGC AGGC AGTATGGCCGGCTACGGTGCCTGTACTT 600 
AT IVAKRQEDTMGASAVSM 



601 

TGCTGGAGGTCTCTAAGGCCCACCGACACGACCCCGACGCCTTCCCCACAGGGGGCGCTT 660 



661 

CCTGCCGCCTGCGGCGCCTGCGGCGCCGGCAGAGGGGCCGCCTGCCCAGGGTCGCAGGAC 720 

* 

721 

CTCTCCCGAACCGCCGCCGAACTGCGGC ACGACAGGGCGCCGAACGCCTTGCGCTTCATG 780 
781 

GCCGGTCGCATGCCCGCAACGTGGCCTGCACATGCGGCC AGCCCTGGGGAGCATGGGGGC 840 
841 

CTCGGCCGGCTGGGGCCGCCGAGGCCCCCATGCCTGCGCGGCCTGGCCGGGCTCGCTCGG 900 
901 

CCTGCCCAGCCTGCCACGCGCACCAAGGCCACACAGCCTGTCGAGCCTGCCTGGCCTGCC 960 



FIG. 1B 



961 

ACGCGCACCAAGGCCACACAGCCTGTCGAGCCTGCCCAGCCTGCCACGCGCACCi^AGGCC 

* * * • « 

1021 

GTGCGGCCTGCCCAGTCAACGGCTAGTACCGCTCGTTACGGCCCCACATGGCGAGGGGCC 

*NGIMALLAPTYR 

1081 

TGTGGCCCACCCTCTAGCGCCGGCAGTGGAGGCGCTCCCTGGCCAGCAGGTCGGCCTAGC 
SVPHS IAATVEALS RDDL 
* 

1141 

TCCGCCGCCGCTCTAACAGGCGCTCTACCCGGCCCAAGCGCCACGGGCCCTAGCCCTGCT 
SAAALNDALHAPNATGPI 

1201 

GCAGGAGCGGGGCCACCACGTCGGTCCGCTCGCGCTCGACACGGTCCCAGTCGGGGTCTG 
VDEGRHHLWALALQALTL 

1261 

GCAGGCGCTGGCCCGCGTCGGCCACGTCGTTGCTCGCCAACGCGCGCTCCCGGCCTCGCG 
GDAVPRLRHLLSRNRALA 

1321 

ACTTGGCCCCGACCGGGGCCGCCTTCAGGAGCAGGGGGTCTAGCAGCCACCAC3CCTACC 
SFRPQGRRFDEDGLDDTT 

1381 

ACGGCCACTCTTTTGGGGCAGGGTCTCCCCGCATTCGCTGCTAGGGCTAGGGGTCGAGGG 
TGTLFGRGLPAYAVIGIG 

1441 

CCGTCTGCCCGTGGTGGAGCAGGAGCTAGGGCGCGCTGGTGTCCGAGGTGAGCGAGACGT 
RCVPVVEDEIGRSWLSWE 

FIG. 1C 



1501 

GGCGGCAGTGGCCCACGTGGCGCAGGCGGGCCGCGTCGC ACCGGCGCCTCCCGAGCCTCT 1560 
VATVPHVADARRLTAASPES 



1561 

CTGGCTCGGACGCCTGGAACGGGAGCGCGTGGTCGAGCCGGTGGCGTGGGTGCCAGAGGA 1620 
LGLRRVKGERVLEAVAGVTE 



m 1621 

g GCTAGCCGTGGCGGCCCAGGCAGGTCACGACCATCATGTCCAGCTACGCCAGCCACGGCT 1680 

S| EI PVAPDTWHQYYLD IRDTG 

| !!5 * ..... 
Ill 

Si i6si 

^ CTGCTGCGTCCCTGGCAAGCGTCCGGCGCGCCTGCATCCTGCCGAGCGGCGTGTTCGGGA 1740 

rl tiRRL SRECAARVYS PEGCLG 

fit 



fS 2 



1741 

CCCTCCGCGGCAGCCTGCTCGCGTGGTACGGCTTGAACCACCGCTAGTCGTGGAGCAGGG 1800 
QSAGDSSRVMGFKTAILVED 



1801 

CCGCCGGGCGCTGGCGGGCAGGCTCGTCGAGGAGTGGCCGCGGCTCGGGGACCTGCAGCC 1860 
RRGAVARGLLEEGAGLGQVD 



1861 

GCCAC AGGTCGTCCC ACTGGGGCCGCAGCTGCCGCCGCGCCTACCACCGGC AGCGGGCCC 1920 
ATDLLTVGADVAARI TATAR 



1921 

GCGCC AGGCCCGCAGGC ATCTTCAGCCACCAGCCGTCCGTCGGCTCGGGGACCCGTGACT 1980 
ARDPRGYFDTTPLCGLGQAS 



1981 

GGCCTTCCAGGGCGTCCCGCGCCTGGCCGCCTGCGCCTTGGCGCCGCCTGTGCCTTGGCC 2040 
VPLDRLARVPPRPVAASVSG 



FIG. 1D 



<orfupl 

2041 

CCGGGGACTCGGGCGGAGAGCGGGACATACGGAACCTCCACAGGCGGAGCCGGGAACGGG 

GGCCCCTGAGCCCGCCTCTCGCCCTGTATGCCTTGGAGGTGTCCGCCTCGGCCCTTGCCC 
APSEPPSRSM 



2101 

ACGAGGGCGAGGACGGGACGGAACGAAGGAGAGGACGGGACGGACAGCACGGACGGGACG 
TGCTCCCGCTCCTGCCCTGCCTTGCTTCCTCTCCTGCCCTGCCTGTCGTGCCTGCCCTGC 



2161 

GACGGAACGGAGTCGGGAACCGGGGGGGGTGACCGGAACCGGGCCGTCCTTGGCCCTCCC 
CTGCCTTGCCTCAGCCCTTGGCCCCCCCCACTGGCCTTGGCCCGGCAGGAACCGGGAGGG 



2221 

CCGTCCTCCCCGCCATCCGCCGTTCTCCCCCGTTCCCTCTCCCGTCCTCCAGCCAACACC 
GGC AGGAGGGGCGGTAGGC GGCAAGAGGGGGC AAGGGAGAGGGCAGGAGGTC GGTTGTGG 



2281 

GCCGCCCTTTCCAAGCGCTTGACACGGCACCGACAGCCGCCGCCGGGCGCCCGATGGGGA 
CGGCGGGAAAGGTTCGCGAACTGTGCCGTGGCTGTCGGCGGCGGCCCGCGGGCTACCCCT 



2341 

CCCGTGCCCGCCGGTGAGCGGCGGTGAGCGCCGGTACGGGACCCCACGCGCCGCCGCCCG 
GGGCACGGGCGGCCACTCGCCGCCACTCGCGGCCATGCCCTGGGGTGCGCGGCGGCGGGC 



2401 

GGCGCCCGCCAGGGCCCGCGCGGCCACCCCGGCCCGCCCCGGCCGGAGCGGCGATCCGGG 
CCGCGGGCGGTCCCGGGCGCGCCGGTGGGGCCGGGCGGGGCCGGCCTCGCCGCTAGGCCC 



FIG. 1E 



2461 

CCGCTCGCTGCAAGAGGAAC ATCC AC AGCCGCACAAGGAGCGCTCCGCAC AGTGGGC ACC 2520 
GGCGAGCGACGTTCTCCTTGTAGGTGTCGGCGTGTTCCTCGCGAGGCGTGTCACCCGTGG 



2521 

ACGTCCGCCCCGTCCCCCACACCGTGGCCGGTCCCCACCGGACAGC AC AGCACCGCACAG 2580 
TGC AGGCGGGGCAGGGGGTGTGGC ACC GGCCAGGGGTGGCCTGTCGTGTC GTGGCGTGTC 



2581 

CACCACATCGCACGGCACAGCACAGCACCACCGGCACGAGGAACCAAGGAAAGGAACCAC 2640 
GTGGTGTAGCGTGCCGTGTCGTGTCGTGGTGGCCGTGCTCCTTGGTTCCTTTCCTTGGTG 
casl> 

MTSVDCTAYGPELRALAA 

' 2641 

ACCACCATGACCTC AGTGGACTGC ACCGCGTACGGCCCCGAGCTGCGCGCGCTCGCCGCC 2700 
TGGTGGTACTGGAGTCACCTGACGTGGCGCATGCCGGGGCTCGACGCGCGCGAGCGGCGG 



2701 

CGGCTGCCCCGGACCCCCCGGGCCGACCTGTACGCCTTCCTGGACGCCGCGCACACAGCC 2760 
RLPRTPRADLYAFLDAAHTA 



2761 

GCCGCCTCGCTCCCCGGCGCCCTCGCCACCGCGCTGGACACCTTCAACGCCGAGGGCAGC 2820 
AASLPGALATALDTFNAEGS 



2821 

GAGGACGGCCATCTGCTGCTGCGCGGCCTCCCGGTGGAGGCCGACGCCGACCTCCCCACC 2880 
EDGHLLLRGLPVEADADLPT 

NcoX . 

2881 

ACCCCGAGCAGCACCCCGGCGCCCGAGGACCGCTCCCTGCTGACCATGGAGGCCATGCTC *2940 



FIG. 1F 



TPS STPAPEDRSLLTMEAML 



KpnX. 

2941 

GGACTGGTGGGCCGCCGGCTCGGTCTGCACACGGGGTACCGGGAGCTGCGCTCGGGCACG 3000 
GLVGRRLGLHTGYRELRSGT 



3001 

GTCTACCACGACGTGTACCCGTCGCCCGGCGCGCACCACCTGTCCTCGGAGACCTCCGAG , 3060 
VYHDVYPSPGAHHLS SETSE 



W 3061 



ACGCTGCTGGAGTTCCACACGGAGATGGCCTACCACCGGCTCCAGCCGAACTACGTCATG 3120 
TLLEFHTEMAYHRLQPNYVM 



3121 

CTGGCCTGCTCCCGGGCCGACC ACGAGCGC ACGGCGGCC ACACTCGTCGCCTCGGTCCGC 3180 
LACSRADHERTAATL'VASVR 



3181 

AAGGCGCTGCCCCTGCTGGACGAGAGGACCCGGGCCCGGCTCCTCGACCGGAGGATGCCC 3240 
KALPLLDERTRARLLDRRMP 



3241 

TGCTGCGTGGATGTGGCCTTCCGCGGCGGGGTGGACGACCCGGGCGCCATCGCCCAGGTC 3300 
CCVDVAFRGGVDDPGAIAQV 



3301 

AAACCGCTCTACGGGGACGCGGACGATCCCTTCCTCGGGTACGACCGCGAGCTGCTGGCG 3360 
KPLYGDADDPFLGYDRELLA 



3361 

CCGGAGGACCCCGCGGACAAGGAGGCCGTCGCCGCCCTGTCCAAGGCGCTCGACGAGGTC 3420 
PEDPADKEAVAALSKALDEV 



FIG. 1G 



/ 



3421 

ACGGAGGCGGTGTATCTGGAGCCCGGCGATCTGCTGATCGTCGACAACTTCCGCACCACG 3480 
TEAVYLEPGDLLIVDNFRTT 



3481 

CACGCGCGGACGCCGTTCTCGCCCCGCTGGGACGGGAAGGACCGCTGGCTGCACCGCGTC 3540 
HARTPFS PRWDGKDRWLHRV 



3541 

TACATCCGCACCGACCGCAATGGACAGCTCTCCGGCGGCGAGCGCGCGGGCGACGTCGTC 3600 
YIRTDRNGQLSGGERAGDVV 

A F T P R G * Sad 

3601 

GCCTTCAC ACCGCGCGGCTGAGCTCCCGGGTCCGACACCGCGCGGCTGAACCCACGGTCC 3660 
CGGAAGTGTGGCGCGCCGACTCGAGGGCCCAGGCTGTGGCGCGCCGACTTGGGTGCCAGG 



3661 

rii 



GGGGCCCACGGTCCGGCACCGCGCGGCTGAGCCCCCGGGTCCGGCAGCGGGCGGCTGAAC 372 0 
CCCCGGGTGCCAGGCCGTGGCGCGCCGACTCGGGGGCCCAGGCCGTCGCCCGCCGACTTG 



3721 

CCCCGCCCCGGGCCACCGCCCGACCGCCCCCGCGCACCGGACGCGCCCGCCTGTACGGCG 3780 
GGGGCGGGGCCCGGTGGCGGGCTGGCGGGGGCGCGTGGCCTGCGCGGGCGGACATGCCGC 



3781 

GTCCCGCCCGGGCCCGTACACCTGAAGCGCCCGGCGGACCGCCGCCCCGCCGGGGGACGG 3840 
CAGGGCGGGCCCGGGCATGTGGACTTCGCGGGCCGCCTGGCGGCGGGGCGGCCCCCTGCC 



-> <- 



3841 

ACAGAGCCGGGTGCGGGAGGACGTCCTCCCGCACCCGGCTCCCACCGTTCCGCAGCGACC 3900 
TGTCTCGGCCCACGCCCTCCTGCAGGAGGGCGTGGGCCGAGGGTGGCAAGGCGTGGCTGG 

FIG. 1H 



3901 



5 . S 



GCACCCGACCGTGCCGCAGGCGCCACCGGCACCGCACCGCCCGCGCCGGCAGCCACCACA 3960 
CGTGGGCTGGCACGGCGTCCGCGGTGGCCGTGGCGTGGCGGGCGCGGCCGTCGGTGGTGT 

3961 

GGCGCCACGCCGCCCGCACGGTGCCCGCGCTGCTCAGCCCCCGTCCACCGGGCTGTCCAG 402 0 

CCGCGGTGCGGCGGGCGTGCCACGGGCGCGACGAGTCGGGGGCAGGTGGCCCGACAGGTC 

*GGDVPSD 

L 



4021 

GTCGGCGGCGTCGCGCGGGGGCTACTTGAGGGCCAGCCGCCGGCTGGGGGGCCTGGGGCG 4080 
LRRLAGGIFERDAASGGSG 



m a 



4081 

CTCTACGGGGGTGTGAGGGCCCTAGTGGAGGTCGCTCCGTATGCCGTCGTCTAGCCGGTG 4140 
LHGWVGPIVELSAYPLLDA 

V 



4141 

GGCGAAGAGCAGGAGCTGCCGCTTTGTGTGCAGGTCCCGCGGGCCGTCGTGGTGCCGGGC 4200 
RKEDEVAFCVDLAGPLVVA 

R 



4201 

GCGGCACTGCCTCCGGTCGCGGCGGAGCTGCGAGGGGGGCCGGGGCCCACAGCGGGGGTG 4260 
ATVSALAAEVSGGAGPTAG 

V 

NcoX . 

4261 

TAGGCACAAGAGGGTCC ACGCGTGGTACCACTCGTCTAGGCGCCGCGGCCCGGGCCTCTC 4320 
DTNEWTRVMTLLDAAGPGS 



FIG. 11 



5J=# 



4321 

CTTCTGGACGAGGGTCTTCGGCCACTCCATGAGGAGCGCCCACCGCTTTGGGTCGAGGGC 4380 
FVQEWFGTLYEERTAFGLE 

R 



4381 

CACCCGTGCCGCCCGGGTCTTCCTTGCGCTCCAGGGGGTGGGCCGCTTGTGGGCCGGGCG 4440 
HARRAWFSRSTGWGAFVRG 



4441 

GCGGAAGGCGGGGGCGAGGGGCCGCAGCCGCGACTCGCGGCGCCGGTCTGGCCTGTCGTC 4500 
AKRGREGADASLAAALGSL 

L 



4501 

CTGGTCCGACACGCCCGACGAGTGGCCGCGGGGCGTCTAGCCCCGCTAGGCCGCGTGGTA 4560 
VLSHPSSVPAGCIPAIRRV 

M 



4561 

GGGGCCTACGCTGTGCCGGGTGACCATCCGCACCCGGCGCGGGTAGCTGGTCGGGC ACTG 4620 
GPHSVAWQYAHAAGMSWGT 

V 



4621 

GTCCCGGTC AAGGGC ATGGGGGTCGAGGAGCCACTCGTCGGCC ACGACGCGGCGCTGTAA 4680 
LALERVGLEETLLRHQAAV 

N 



4681 

C AGGACGCCTCACTAGTCGCCTTTCGCCCTGGGGCTGCCCACCAACGGCCCGCTCGACCT 4740 
DQPTILPFRSGSPHNGPSS 

S 



FIG. 1J 



4741 

CTGGGGCAACGGCTTCTCAGGCCGCC ACTGCTGCGTCATGGCGGCCC ACAGGTCGCCGTC 4800 
VGNGFLGATVVCYRRTDL P 

L 

■ * • • • 

4801 

GGGGCGTGGCTAGTCGGTCAGCATGGGCCACACCAGGGCCGGCTTCTTGCTGCCTGTCTC 4860 
GAGILWDYGTHDRGFFS PC 

L 

4861 

GTGGTGCAAGC AGGGCAGCCGCAAGCCGCACGGC ATGTACCGCATTGGCTAGGCCCGC AG 4920 
VVNTGDANPTGYMAYGIRA 

D 

<orfdwnl 
4921 

GGCGTCCTGGAGGGGC AGGTCGTTGCCGTCAAGCAGCTAGAGCTTATACGCCGTAAGGTG 4980 
RLVEGDLLPLEDIEFIRCE 

M 

* • • « 

4981 

GCGACTGGAGGAACAAGCTAGGGGGGCCTGTTGTCCAGCCAGCACCGGCCTCTGAGTCTC 5040 
L 

5041 

GGTCAACCCCCGCTAGAGCCACCGGGTGTCGAGGTCCGACGCGTCGACCTGTAGCACGCC 5100 
WNPAIETAWLELSRLQVDH 

P 

5101 

CTAGTCGGGCCTCATGACCGTGACCTCGTCTATGAGGCCTAGCACGGCGAGGTGGTCGAA 5160 
ILGSYQCQLLYEPDHREVL 

K 

5161 

GAGCTAGTACGCCAACTACAGCAGGCCCCACGGCTGGGTGAGGTCGGGGGCCAGCTGGTC 5220 

FIG. 1K 



EIMRNIDDPTGVWELGRDV 

L 

5221 

CC AGAAC ATCAGGCTCGGCTAGCCTGGGCAGAGCGGCC AGCGCGCGTCGCGGAGCC ACTT 5280 
TKYDSGI PGTEGTARLAET 

F 

Ncol ..... 

5281 

CGGGTACCCCGGCTTGGTCAAGAGCTTCTACTTCGGCGGCGGCGCCCTGCGGGTCACCAC 5340 
GMPGFWNEFI FGGGRSAWH 

H 

5341 

CCGGAGCGGCCTCAGGGCCCTCTGGTCCTGCAGGAAGTAGTGGGGCTGGGCGAGCGGGGC 5400 
AEGSDRSVLVDKMVGVREG 

R 

5401 

GGCGTCCCACGGC ACCGGGCGGCGGAGCCGGAGGAGGGCC ATCTACAGGTAGTCGGCCCG 5460 
RLTGHGAAEAEERYIDMLR 

A 

* 

5461 

CTGCTAGACCAGCAGCCAC AAGTAGTCCTAGCCGTGGTGCGGGAGGGCCCGTGTCTTGGC 5520 
VIQDDTNMLI PVVGERACF 

R 

5521 

CTTGCACAGGAGTGACTTCGACTTGCCGACCTTCTGCCCGCCCACCCCCGCGACCATCCC 5580 
FTDESFSFPQFVPPHPRQY 

P 

5581 

GAACCCGCGCTACGGGTGGAGCGCCTACTGCGGCAAGAGCAGCTCCGGGGCCGGCATCGC 5640 
KPAIGVERIVGNEDLGRGY 

FIG. 1L 



5641 

CGCGTGGCGGAGCATCCCCTTGAGGTCCAGGCCGTGGCCCTAGC AGGTGACGAGGGGCCT 5700 
RVAEYPFELDPVPITWQEG 

S 



5701 

CACCCACTTGCAGAGCCAGCAGGTGCGGAAGAACTACTAGAGGGTCACGAGGAGCTTCTC 5760 
HTFTETTWAKKI IEWHEEF 



5761 

CCGTGCTAACGCGGCCAGGGCGAGGGGCCGCAGCCTGTCCCACGGCGGCTGGGGCATGTG 5820 
ARNRRDREGADSLTGGVGY 

V 



5821 

GACGGGGTACTACAGCCGGGTCGCGAAGACCTTGGGCGCGCGCTAGGGCTGCTTCCGCGC 5880 
QGMI DAWRKQFGRAIGVFA 

R 



5881 

CGGGGCCCAGTACACCAGCTCGTAGCGGTCTAGGAGCCGGTCGGCGTCGCCTAACACGTC 5940 
GRTMHDLMALDEALRLPNH 

L 



5941 

GC CGTC CTGCAACCGGTAGACCGGCTGGGCCTACACGGCCC AGACGTACGGCTC CATCTC 6000 
PIiVNAMQGVRIHRTQMGLY 

L 



6001 

GGGGTCGTACTAGCCCAACAACCTCTGGAGCTTTGGGAGC C AC AC CTTCACC ACGAGCC A 6060 
GLMIPNNSVEFGETKFHHE 

T 



FIG. 1M 



6061 

CTTCCTGTCAGGGGTCATCGGCTCAAGCAGCCGGCGGACGCGGACGGCCCACTCGACGGC 6120 
FSLGWYGLEDAAQAQRTLQ 

R 



6121 

CTCGTACAAGACCATCAAGACGCCTAACTGGGGGCGGTATGGGGCGACCTGGACGCGTAC 6180 
LMNQYNQPNVGAMGRQVQA 

H 

<orfdwn2 

6181 

ACTGCCGACCGTTGGCAGATAGAAGAGAATGGACTTCACCCTGGCTCCTCCGGTTCGCGG 6240 

TGACGGCTGGCAACCGTCTATCTTCTCTTACCTGAAGTGGGACCGAGGAGGCCAAGCGCC 
SGVTPLYFLISKM 

6241 

CGCCCTCCATTGACGTGCGCCGAAAGCGGCTCGACCGTCCCACTCCGCCCTTGAGTTCCG 6300 

GCGGGAGGTAACTGCACGCGGCTTTCGCCGAGCTGGCAGGGTGAGGCGGGAACTCAAGGC 

• • . . . 

6301 

TCTGACGCCGCGCCAGTCGGCGGGCCGTCCGCCGGGGTGCCCGCCGGGGTCCGCACCCGC 6360 

AGACTGCGGCGCGGTCAGCCGCCCGGCAGGCGGCCCCACGGGCGGCCCCAGGCGTGGGCG 

• # • • • 

6361 

CGGACGGCACGGCGCGC ACCGCGCGCGCGGCGCTTCGGGGC ACCGGGCTCGACGGGGTGC 6420 
GCCTGCCGTGCCGCGCGTGGCGCGCGCGCCGCGAAGCCCCGTGGCCCGAGCTGCCCCACG 

6421 

TCAGCGGGACGTCCAACGGAAGGCAAGCCCCCGTACCC AGCCTGGTCAAGGCGCTCATCG 6480 
AGTCGCCCTGCAGGTTGCCTTCCGTTCGGGGGCATGGGTCGGACCAGTTCCGCGAGTAGC 

orfdwn3> 



FIG. 1N 



6481 

CCATTCCCTGAGGAGGTCCCGCCTTGACCACAGCAATCTCCGCGCTCCCGACCGTGCCCG 6540 
GGTAAGGGACTCCTCCAGGGCGGAACTGGTGTCGTTAGAGGCGCGAGGGCTGGCACGGGC 

6541 

GCTCCGGACTCGAAGCACTGGAC CGTGCCACCCTCATCCACCC C ACC CTCTCCGGAAACA 6600 
SGLEALDRATLIHPTLSGN 

T 

• • • . . 

6601 

CCGCGGAACGGATCGTGCTGACCTCGGGGTCCGGCAGCCGGGTCCGCGAC ACCGACGGCC 6660 
AERIVLTSGSGSRVRDTDG 

R 

• • • • • 

6661 

GGGAGTACCTGGACGCGAGCGCCGTCCTCGGGGTGACCCAGGTGGGCCACGGCCGGGCCG 6720 
EYLDASAVLGVTQVGHGRA 

E 

* • . » 

6721 

AGCTGGCCCGGGTCGCGGCCGAGCAGATGGCCCGGCTGGAGTACTTCCACACCTGGGGGA 6780 
LARVAAEQMARLEYFHTWG 

T 

6781 

CGATCAGCAACGACCGGGCGGTGGAGCTGGCGGCACGGCTGGTGGGGCTGAGCCCGGAGC 6840 
ISNDRAVELAARLVGLSPE 

P 

6841 

CGCTGACCCGCGTCTACTTCACCAGCGGCGGGGCCGAGGGCAACGAGATCGCCCTGCGGA 6900 
LTRVYFTSGGAEGNEIALR 

M 
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6901 

TGGCCCGGCTCTACCACCACCGGCGCGGGGAGTCCGCCCGTACCTGGATACTCTCCCGCC 
ARLYHHRRGESARTWIL 

R 



6961 

GGTCGGCCTACCACGGCGTCGGATACGGCAGCGGCGGCGTCACCGGCTTCCCCGCCTACC 
SAYHGVGYGSGGVTGFP 

H 



7021 

ACCAGGGCTTCGGCCCCTCCCTCCCGGACGTCGACTTCCTGACCCCGCCGCAGCCCTACC 
QGFGPSLPDVDFLTPPQ 

R 



7081 

GCCGGGAGCTGTTCGCCGGTTCCGACGTCACCGACTTCTGCCTCGCCGAACTGCGCGAGA 
RELFAGSDVTDFCLAEL 

T 

Sau 

7141 C CATC GACCGGATCGGCCCGGAGCGGATC GCGGCGATGATCGGC GAGCCGATC 
IDRIGPERIAAMIGEPI 



FIG. 1P 



